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Starting from an interpretation of the classical-quantum correspondence, we 
derive the Dirac equation by factorizing the algebraic relation satisfied by the 
classical Hamiltonian, before applying the correspondence. This derivation 
applies in the same form to a free particle, to one in an electromagnetic field, 
and to one subjected to geodesic motion in a static metric, and leads to the 
same, usual form of the Dirac equation—in special coordinates. To use the 
equation in the static-gravitational case, we need to rewrite it in more gen¬ 
eral coordinates. This can be done only if the usual, spinor transformation of 
the wave f un ction is replaced by the 4-vector transformation. We show that 
the latter also makes the flat-space-time Dirac equation Lorentz-covariant, 
although the Dirac matrices are not invariant. Because the equation itself 
is left unchanged in the flat case, the 4-vector transformation does not al¬ 
ter the main physical consequences of that equation in that case. However, 
the equation derived in the static-gravitational case is not equivalent to the 
standard (Fock-Weyl) gravitational extension of the Dirac equation. 
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1 INTRODUCTION 


Dirac’s equation for a relativistic particle is unanimously recognized as a 
historical step, although it is now known [Tj that several arguments promot¬ 
ing the Dirac equation against the then competing Klein-Gordon equation 
were not conclusive in view of the later developments. The main argument 
in Dirac’s first paper on his equation [2; was that according to which “the 
wave equation [should be] linear in W [the energy] or d/dt, so that the wave 
function at any time determines the wave function at any later time.” But any 
system of partial differential equations may be rewritten (in many ways) as a 
first-order system, e.g. by introducing partial derivatives of the unknowns as 
new unknowns. The Klein-Gordon equation may indeed be rewritten in the 
Schrodinger form vindicated by Dirac, by introducing a 2-component wave 
function JJ). Another argument noted that the time component of the Dirac 
current is positive, whereas that of the Klein-Gordon current is not and hence 
does not allow a probability interpretation. In the mean time, however, it 
has been found that the localization of quantum relativistic particles must 
appeal to different coordinate operators [3]: therefore, “in this respect the 
Klein-Gordon equation is neither better nor worse than the Dirac equation” 
[I]. It is now accepted that the Klein-Gordon equation describes spin-0 par¬ 
ticles, whereas Dirac’s applies to spin-1/2 particles. 

Since the Dirac equation remains extremely important today, the 
derivation of this equation is an important point. The textbooks {e.g. Bjorken 
and Drell U, Schulten j|) follow closely Dirac’s original derivation based on 
the factorization of the Klein-Gordon operator. There are also derivations 
{e.g. Ryder [7]) which assume notions such as spin and spinors, and which 
therefore, one might say, start from knowledge which Dirac’s equation has 
the advantage of leading to naturally. On the other side, we find derivations 
which are based on an extraneous framework, quite remote from the problem 
solved by Dirac: Srinivasan and Sudarshan 0 introduce quaternion measur¬ 
able processes, i.e., families of quaternion measures; Close 0 shows that 
a Dirac-type equation describes torsion waves in an elastic solid; Celerier 
and Nottale JTQ| start from scale relativity with its fractal space-time. In 
the present paper, the Dirac equation will be obtained more directly from 
the Hamiltonian of a classical relativistic particle. One advantage is that 
the same method, and indeed essentially the same equation, now applies to 
the cases of: a free particle; a particle in the electromagnetic held; and a 
particle in a static gravitational held. The gravitational Klein-Gordon equa- 
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tion has previously been obtained by this method m- Another point is 
that the method is based on an interpretation mma of the correspondence 
between a classical Hamiltonian and a quantum wave equation, which may 
be considered as a justification of that correspondence—whereas the latter 
is usually taken as an axiom. The fact that the derivation based on this 
interpretation/justification applies also to the gravitational case means that 
this approach to writing quantum wave equations in a gravitational field is 
more direct than the usual approach. The latter is inpired by Einstein’s 
equivalence principle and aims at writing a covariant equation that coincides 
with the flat-space-time version if the coordinate system is such that, at the 
event considered, the metric is Minkowskian and the connection cancels. The 
application of this standard approach may be ambiguous, e.g. because co¬ 
variant derivatives do not commute. Thus, the covariant generalization of the 
Klcin-Gordon equation depends on an arbitrary parameter which multiplies 
the curvature scalar PI- In the case of the Dirac equation, the standard ap¬ 
proach leads to the equation independently proposed by Fock T4] and Weyl 
El, but this followed only after a rather involved analysis—as one may re¬ 
alize from the historical account m, as well as from the derivation of this 
extension of Dirac equation to curved space-time PI- 

This complexity of the standard extension of the Dirac equation to 
the gravitational case is related to the special transformation law that is used 
for the Dirac wave function, namely, the spinor transformation. Especially 
in a curved space-time, spinor representations are related with elaborated 
differential-geometrical concepts. However, one may ask if this mathemati¬ 
cal sophistication is both absolutely necessary, and physically adequate. In 
the present paper, it will be argued that the spinor transformation results 
from a contingent interpretation of the relativity principle in the context 
of the Dirac equation: another interpretation allows one to use the stan¬ 
dard 4-vector transformation instead. Moreover, it will be found that, for 
the proposed version of the gravitational Dirac equation, only the 4-vector 
transformation can be used. The new interpretation of the relativity principle 
for Dirac’s original (flat-space-time) equation does not change the physical 
consequences of the latter, at least not the direct ones. This is because these 
consequences, such as the emergence of spin and the precise prediction of the 
energy levels of hydrogen-type atoms, are those of the equation itself (see e.g. 
Schulten |6J), instead of being consequences of its transformation law under 
a Lorentz boost. However, the new derivation of the gravitational Dirac 
equation does lead to a different equation as compared with the standard 
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version—this was already the case for the Klcin-Gordon equation HU. 


2 ASSOCIATING A QUANTUM WAVE EQUA¬ 
TION WITH A CLASSICAL HAMILTO¬ 
NIAN 

Here, the results of a previous work [I2J will be summarized and extended. A 
very important and rather mysterious feature of quantum mechanics (QM) 
is the classical-quantum correspondence, that associates a linear differen¬ 
tial operator with a classical Hamiltonian, and that leads to regard en¬ 
ergy and momentum as operators of this kind. The attempt |T2| at in¬ 
terpreting the classical-quantum correspondence starts from remarks made 
by Whitham j!8j in the context of the theory of classical waves. Essentially, 
for any linear wave equation, one may define different “wave modes,” each of 
which is characterized by a dispersion relation , i. e. an explicit dependence of 
the frequency uasa function of the spatial wave (co-)vector k, u> = W (k; A"). 

(In the general case of heterogeneous propagation, the dispersion depends in¬ 
deed on the space-time position A".) It turns out [TH] that, for a given wave 
mode, the wave vector k propagates along the bicharacteristics of a certain 
linear partial differential equation of the first order. When the latter equation 
is put into characteristic form, one obtains a Hamiltonian system, in which 
the Hamiltonian is none other than the dispersion relation W defining the 
given wave mode of the wave equation considered. Now, with certain pre¬ 
cautions which are made necessary by the existence of several wave modes, 
one may recover the wave equation from the dispersion relation alone. Hence 
one guesses that, under favourable conditions, one may associate a relevant 
wave equation with a classical Hamiltonian, as is done in QM. 

2.1 Dispersion equation and dispersion relations for a 
linear wave operator 

In order to explicitly formulate this interpretation in a general-enough case, 
it is necessary to generalize somewhat Whitham’s remarks El. especially in 
what regards the case of variable coefficients. The notion of a wave assumes 
that the wave function -0 is defined over an open domain D in an extended 
configuration space V = R x M, with M the A-dimensional configuration 
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space. Consider a linear differential operator P defined “on D” (in fact, P 
is defined on a suitable space of functions having D as common domain of 
definition). Let us assume a second-order operator for simplicity (this is not 
necessary, but it is enough for the application to QM): 

P0 = a 0 (X) + af(X)d^ + af (A)<9,A0, (1) 

where X, with coordinates 2^(0 < fi < N ), is the relevant point of the 
extended conhguration space V. Since the operator is linear, it is relevant to 
consider solutions of the equation 


P0 = 0 (2) 

in the form of sinusoidal waves: 

il>(X)=A{X)exp[id{X)], (3) 

where the phase 9(X) is a real function. For a function of the form (J3|) 
[whether it is a solution of @ or not], we define the wave covector 

K = (/v), K, = dffi. (4) 

Let a function of the form © have constant amplitude A and be such that, 
at the point A" considered, we have 


d v K ft {X) = d v d li e{X) = Q. (5) 

Substituting © into ©, and accounting for ©, one finds that a such func¬ 
tion obeys m at X if and only if 

U x (K).A = 0, (6) 

where K = K(X), and where 

n x (K) = a 0 (X) + i af(X)K, + i 2 a^{X)K,K V . (7) 

If one considers equation © for scalar functions, 'f’(X) G R or -0(A) G C, 
then the coefficients of P must be scalars, and © for one non-zero A is 
equivalent to Lfx(K) = 0. On the other hand, if 'il’(X) belongs to some 
finite-dimensional vector space E, say E = C m , then the coefficients of P 
are not necessarily scalars, but may also be matrices with m rows and m 
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columns. In that case, IIx(K) = 0 is equivalent to “ P.Ae 10 = 0 at X, 
for every A e E” (assuming ©). 1 The existence of real solutions K to 
the equation IIx(K) = 0 is precisely the condition under which, at the point 
considered X e V, the linear equation © with © is indeed a wave equation. 
One checks easily that the correspondence between the linear operator P and 
the polynomial function with variable coefficients II is one-to-one, also in the 
case with matrix coefficients. The inverse correspondence, from II to P, 
consists simply of the substitution 

K, — dji. ( 9 ) 

Suppose one is able to follow as function of X the different roots of the dis¬ 
persion equation n y(K) = 0, seen as a polynomial equation for the frequency 
u = —Kq, thus one is able to identify the different wave modes. Then it is 
possible to define different dispersion relations 

u = W{K u ,..,K n -X), (10) 

each of which gives the frequency — Kq as a function of the spatial wave 
(co)vector k = {K\, ..., K^), i.e. the spatial part of K, for the considered 
wave mode. For each of these functions W, an argument of Whitham (na, 
Sec. 11.5), reproduced in Ref. [T2J, proves that, if a wave function © is such 
that the wave covector 0 is a solution of Eq. (HDD, then the modification of 
the spatial wave vector is governed by a Hamiltonian system with Hamilto¬ 
nian IE: 2 if the value of k is given at some point A" = (i, x), we have the 

1 A fixed system of coordinates has been assumed given on V: clearly, the coefficients of 

P are coordinate-dependent. If one allows for coordinate changes, one finds m that II y 

is well-defined if and only if one can define a certain class of coordinate systems connected 

by “infinitesimally linear” changes, that is, 

d 2 x' p 

dx p dx v ^ ^ 

at the point X(xq) = X(x' Q p ) considered, and if one admits only those coordinate systems 
that belong to this class. In particular, if the space V is endowed with a pseudo-Riemannian 
metric 7 , a relevant class is that of the locally geodesic coordinate systems (LGCS) at X, 
i.e., 7 nv,p{X) = 0 for all p,v,p. It is elementary to check that this is stable by a change 
©■ Conversely, if [x p ) is an LGCS at X for 7 , and if one changes to coordinates x' v , the 
Christoffel symbols of 7 at A are, in the x' v 's\ T p p = If the new system 

is still an LGCS, they all cancel, hence the coordinate change must verify ©• 

2 Whitham did not give a precise definition of the dispersion W for equations having 
variable coefficients, although such are those for which the result m-m is the most 
interesting. 
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coupled evolution given by 


dW 
dxi ’ 


( 11 ) 


dk j 

dt, 


dx j _ dW 
dt dkj 




( 12 ) 


2.2 The classical-quantum correspondence 

When they invented wave mechanics, de Broglie j2DJ[2I] and then Schrodinger 
|22, 221 assumed essentially that a Hamiltonian system describes in fact the 
“skeleton” of a wave pattern associated with some linear wave equation, in 
exactly the same way as geometrical optics describes the trajectories of light 
rays, which are the skeleton of the underlying wave pattern. Geometrical 
optics corresponds to the “nil wave length” limit for which, in the neighbor¬ 
hood of any point X E V, the wave © may be considered as a plane wave, 
that is 

A ~ Constant and 59 ~ k.hx — u5t = K^Sx' 1 . (13) 

Thus, in this limit, Eq. © is satisfied and hence, by definition, substituting 
the sinusoidal wave function © into the wave equation © gives the disper¬ 
sion equation n_v(K) = 0. Hence, in the geometrical optics limit, the wave 
function must obey one of the possible dispersion relations oa, and there¬ 
fore the wave vector k must follow the Hamiltonian dynamics (HU)-©, if, 
in this limit, the trajectories have to be governed by a classical Hamiltonian 
H( p, x, £), one is thus led to admit that the wave equation, which is searched 
for, must have as one of its dispersion relations a function W whose Hamilto¬ 
nian trajectories are just the solution trajectories of the classical Hamiltonian 
H , from which one starts. In order to ensure this, the most natural way E! 
is to assume that the respective Lagrangians: A for W and L for H, are pro¬ 
portional. Denoting h the proportionality constant, one gets simultaneously 
the relations of QM that relate energy with frequency and momentum with 
wave vector: 


H = KW, or E — tux , 


(14) 





as well as the correspondence between a classical Hamiltonian and a wave 
operator. The latter correspondence is got by combining the correspondence 
m-m with © m Moreover, this interpretation provides a resolution 
to the ambiguity which is inherent in the correspondence, in the general case 
where the Hamiltonian contains terms that depend on both the (extended) 
“position 1 ' A" = (t, x) and the canonical momentum p. The rule to obtain 
the wave operator unambiguously is simple: just put the function of A" as a 
multiplying coefficient before the monomial in p - as the dispersion equation 
has to be a polynomial in p at fixed X 

There remains a difficulty in the classical-quantum correspondence, 
however: the classical Hamiltonian H (p, x, t) is, in general, not a polynomial 
in p at fixed X (just as the dispersion W(k;A) is, in general, not a poly¬ 
nomial in k). An exception is the Hamiltonian of a particle subjected to a 
potential force held in classical mechanics, 

H{p,x.,t) = £- + V(t,x), (16) 

2m 

which leads thus directly to Schrodinger’s equation. In a less particular 
case, it may still happen that the Hamiltonian is an algebraic function of the 
canonical momentum p (at fixed A"): there exists a polynomial Q(E, p; A), 
with complex coefficients, that cancels for E = H( p; A). For instance, for a 
relativistic particle, 

Q(E , p) = E 2 — p 2 c 2 — m 2 c 4 = 0 if E = H( p) (17) 

in the free case, and 

Q(E,p; X) = (E-qV) 2 -(p--A) 2 c 2 -m 2 c 4 = 0 if E = H( p;A)(18) 

c 

in the case of a charged particle (with charge q) in an electromagnetic po¬ 
tential (V, A). We may, of course, choose for Q the polynomial of the lowest 
possible order that cancels for E = H(p-,X). [This remark concerns the 
general algebraic case: in the case of a relativistic particle, it is obvious that 
Eq. (1T7I) [or (HU)] has the lowest possible order.] Then, in order to oper¬ 
ate Schrodinger’s wave mechanics so as to associate a linear operator with 
the classical Hamiltonian H , one may brutally apply to the algebraic equa¬ 
tion Q(E, p;A") = 0 the classical-quantum correspondence, i.e., (ITljt - dTHI) 
followed by ©• In the case of a relativistic particle, this leads immediately 


to the Klein-Gordon equation. It amounts to taking Eq. m i or m] as 
the corresponding dispersion equation, up to the h factor and to the sign of 
K 0 = —lu, i.e. [in Cartesian space coordinates (x*)] 

2 4 

, x o o me 

n(K) = - c 2 KiKi - — = 0 . (19) 

Let us select units such that h — c — 1 for simplicity, so that the correspon¬ 
dence m-m writes simply 

E = -A'o, Pi = IU , (20) 

and the dispersion W coincides with the Hamiltonian H. Then, let us define 

or) = ( 2 i) 

with here (Minkowski metric in Galilean coordinates) 

(<5V) = diag(l, —1, —1, —1), (22) 

whence ( g= diag(l, —1, —1, —1) = (c/ Mi/ ) in the present case. Then we 
may rewrite m in the following (generally-covariant) form: 

n(K) = - m 2 = 0. (23) 


We end this Section by noting that, if one admits that it is a wave 
structure which is the fundamental behaviour, then one may a priori expect 
that the classical-quantum correspondence is not always a sufficient tool to 
obtain a correct wave equation. However, in Subsect. 12.11 we associated one 
or several Hamiltonian systems to every linear wave equation. Conversely, if 
one starts from a Hamiltonian that is algebraic with respect to the canonical 
momentum p, we just associated a wave equation to it, whereas it is likely 
that no linear wave equation can be naturally associated with a non-algebraic 
Hamiltonian. It seems thus that classical Hamiltonian systems are actually 
more general objects than are linear wave equations, and that, therefore, 
any relevant wave equation should be obtainable from a Hamiltonian by the 
classical-quantum correspondence. But, in the case of a relativistic particle, 
where an algebraic Hamiltonian is indeed there, several wave equations may 
in fact be associated with it. 
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3 THE DIRAC EQUATION 

In the “algebraic” case described above, the algebraic equation [say Eq. (1T7I) ] 
will usually have other solutions than just the relevant Hamiltonian H( p; A"). 
Correspondingly, the wave equation associated by the classical-quantum cor¬ 
respondence will have too much solutions, many of which shall be irrelevant 
to the dynamics described by the Hamiltonian H —thus, in the case of a rel¬ 
ativistic particle, the Klcin-Gordon equation is correct, but it is not enough 
to specify the behaviour, in other words it has too much solutions. One may 
then think of a factorization of the polynomial in the algebraic equation: if 
this factorization would indeed occur, one might hope that one of the factors 
is indeed relevant. Thus, in the case of Eq. (1331) . we try to get: 

n(K) = -m 2 = (a 0 + i<AT #1 )(6 0 + ib\K v ). (24) 

[We put an i factor before the first-order coefficients af and h\ for conve¬ 
nience: this is in order that af and b\ be directly the coefficients of the 
first-order derivatives in the associated operators, see Eq. Q.] However, 
this factorization cannot occur with complex coefficients, since this would 
mean that Q (or n) is not the polynomial of the lowest possible order, con¬ 
trary to our choice. [If Q(E, p) = 0 for E = H( p), then, of course, one of 
the factors must cancel for E = i/(p).] Hence, the unknown coefficients in 
the rightmost side of (1331) have to belong to some larger algebra A containing 
the complex field C. Therefore, A does not need to be a commutative alge¬ 
bra, moreover the complex numbers shall be identified with the complex line 
C 1 a in A, since a larger algebra means, in particular, a vector space on C of 
dimension d > 1. This means that (1241) should be more correctly rewritten 
as 

(g^K^ - m 2 ) 1 A = (a 0 + iafK^bo + ib\K V ). (25) 

Since we are interested only in the zeros of n(K), we may multiply the right- 
hand side (r.li.s.) by bo.ay 1 on the left, 3 and thus we may assume that 

a 0 = b 0 . (26) 

The product decomposition (1331) is then equivalent to the following equations: 

_2<riA = -m +( 27 ) 

3 In doing so, we impose the additional condition that eto must have an inverse, as will 
be indeed the case, see Eq. EB -thus, we assume that m/0. 
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a^ao + a 0 &i = 0, 


( 28 ) 


—?7i 2 1a = ag. (29) 

From Eq. O, we see that we may impose that a o is (identifiable with) a 
complex number, ao G C 1a, and that it is then 

a 0 = eimlA, e = ±1. (30) 

With this, it follows from Eq. (I2HJ) that 

< = -K = Y- (3i) 

The relation (EZI) that must hold for the quadratic terms is then 

7 / V' + 7 V = 2<ri A . (32) 

This is the well-known relation defining the modified Clifford algebra. (The 
genuine Clifford algebra applies to the objects d° = 7 0 , d? = .) Thus, we 

have rewritten the original polynomial II(K) as the following product: 

n(K)i A = n 1 (K)n 2 (K), (33) 

with 

IIi(K) = eiml A + iYK^, n 2 (K) = eiml A - (34) 

The corresponding dispersion equation II (K) = 0 is satisfied as soon as either 
Ui(K) = 0 or II 2 (K) = 0. Either possibility is a dispersion equation in its 
own right, and thus, by the correspondence Q, is uniquely associated with 
a linear wave equation. If e = +1, the first possibility leads to the Dirac 
equation: 

(h^m - ml AY = 0, (35) 

while the second one leads to its well-known associate, 

(iY®n + ^Ia)^ — 0. (36) 

The reverse occurs, of course, if one chooses e = —1. 

In the case with an electromagnetic (e.m.) field, the correspon¬ 
dence ESI) means rewriting the algebraic equation m as the dispersion 
equation 

n c “(K) = (-/t 0 - qV) 2 - {K t - qA'){l<i - qA { ) - m 2 = 0 (37) 
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( 38 ) 


in Cartesian space coordinates. Let us define (for a given value of A") 

K = K 0 + qV{X), K\ = Ki - qA l (A), 

that is, in general coordinates, simply 

a; = K^ + qA^X) (A° = V, A M = g^A"). (39) 

We have II™ 1 (K) = fI(K / ). Therefore, the factorization of the polynomial 
dSZD in the iC M ’s, as the product of two first-order polynomials, is obtained 
by substituting K' for K into the factorization (1331) of the polynomial fl(K): 

1T^(K) 1 A = {dm\ K + i^K'^{eim\ K -i^ v K' v ) 

= [eiml A + iY l (K^ + gA M )][eiml A - %i'{K v + qA u )\. (40) 

We have again factorized the dispersion equation—this time, that valid with 
an e.m. field. As for the free case, we may apply then the biunivocal corre¬ 
spondence & f° either of the two polynomials on the r.h.s. of (HOD- With 
the first polynomial, and with e = +1, we get 

[ij + ic t A u) ~ = 0, (41) 

which is the Dirac equation in an e.m. held. 

Thus, we derived the original Dirac equation from a factorization 
of that polynomial which provides an algebraic equation for the classical 
Hamiltonian, instead of factorizing the Klein-Gordon operator. One might 
say that this is a variant of the original and usual derivation, rather than 
a totally new derivation. This variant insists, more than does the usual 
derivation, on the algebraic aspects of the Dirac equation. Note, for example, 
that the product decomposition assumed a priori (ESI) is a completely general 
product of two first-order polynomials, whereas the two operators assumed in 
the usual factorization of the Klein-Gordon equation ( e.g. Bjorken-Drell [5], 
Schulten |.6J) are quite particular. Moreover, the present derivation is based 
on our interpretation of the classical-quantum correspondence (Section |2J). 
Thus, the basic reason for which the Klein-Gordon equation is not the last 
word is that the algebraic equation m i or CEBD] obeyed by the Hamiltonian 
is second-order, hence it is tempting to try a factorization, with the hope of 
getting a more fundamental equation. Also, in this context, it is clear from 
the beginning that the correspondence (d-CSD applies to the Hamiltonian 
energy E = H (p, x, t ) with the corresponding canonical momentum p, so 
that the derivation for the free case extends to that with an e.m. held, in a 
fully justified way. Another benefit will appear in Section 0 
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4 TRANSFORMATION OF THE DIRAC EQUA¬ 
TION 

Let x ,fl = F^^x")), or in short X' = F(X), be any admissible tranformation 
of the space-time coordinates, the admissibility being a flexible notion which 
will be subjected to changes. We investigate the transformation of the wave 
function i/j, solution of the Dirac equation (|35|) or (SB, that corresponds to 
F. In this Section, we shall consider as admissible the linear transformations, 
and merely the ones which belong to some subgroup G of the group GL(4, R) 
of all possible linear transformations. Thus, in the standard study of the 
transformation of the Dirac equation, one considers G = 0(1, 3), the Lorentz 
group. We ask that, after any tranformation L e G, the value, in the new 
coordinates, of the wave function at a given event depend linearly on its value 
ip(X) in the old coordinates, because the Dirac equation is linear. Further, 
we ask that the corresponding linear operator S (which acts on the set of 
the values taken by the wave function) be the same for any event X , because 
here we are considering special relativity with its homogeneous space-time, 
this homogeneity being preserved by a linear transformation. Thus we ask 
that 

W) = S (L).^(X), (42) 

for some operator function of L, S — S (L). In fact, we know that the 
simplest realization of the algebra (H21) is provided by 4 x 4 complex matrices 
Y, thus A = M(4, C), so that the values ijj{X) of the wave function are 
elements of C 4 . The regular operators acting on the vector space C 4 form the 
group H = GL(4, C), identifiable with that of the inversible 4x4 complex 
matrices; but, of course, this does not mean that the matrices S = S (L) 
cannot be restricted to some smaller group. From (S3), it follows immediately 
(by considering the inverse transformation L' = L _1 or the composition 
L = L 2 .L 1 ) that we must have 

VL e G, S(L~ 1 ) = [S(L)]-\ S {L2.Lt) = S(L 2 ).S(L x ), (43) 

hence S must be a representation of G into H. Using (S3 and S3 1 in the 
Dirac equation S3), and multiplying on the left by S = S (L), we get 

(iS^^d^S -1 — m)ij)' = 0. (44) 

Since S' is a constant here, and since d/dx M = L^d/dx'" = L u ^d' u , it follows 
that 

[iL^SYS- 1 d' v - m)Y = 0. (45) 
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The same argument applies with an electromagnetic (e.m.) field (covector 
A p ), leading to 


[iL;SYS-' K + iqA'J) - mW = 0. (46) 


At this stage, the standard approach would state that the Dirac equation 
must be Lorentz-invariant, and that hence the new matrices appearing in 

m an d m , 

r = S = S(L), (47) 


must be equal to the starting matrices 7 ". This leads to the spinor represen¬ 
tation S = S S pi n or, which is defined for L G 0(1,3) and cannot be extended 
to GL(4, R) P3L TTi. n?Jl . 


Yes: the relativity principle demands that the equations are Lorentz- 
covariant —and this, without introducing extraneous quantities such as the 
velocity with respect to a preferred reference frame, ffowever, as is well- 
known, it is only for a scalar that this means invariance. For instance, 
consider the relativistic equation of motion of a charged test particle in an 
e.m. field, 

dU p 

m —— = qFhV u . (48) 

ds 

This equation is manifestly Lorentz-covariant; if one substitutes the “absolute 
derivative” for the usual one on the left, it even becomes generally-covariant. 
In it, the 4-velocity U p tranforms like a (four-)vector, and the e.m. field 
F p transforms like a mixed tensor. To deduce from this transformation 
behaviour some hints for the Dirac equation, we begin by remembering that 
the Dirac wave function has four components, ffence we may adopt the same 
condensed notation for Eq. (TIKI) as for the Dirac equation, thus U = (t/ M ), 


F = (Fi): 


m —— = qFU. (49) 

ds 

This makes it clear that, in the relativistic equation of motion of a charged 
particle, there is one matrix object F, which plays the role of a coefficient, 
very much as do the four 7 ^ matrices in the Dirac equation m - And this 
matrix F is not invariant in the transformation of this equation. Namely, 
after any linear transformation L G GL(4, R), thus x = L p x u , it transforms 
like this: 


w'v 

x V 


dx' p dx a 
dx p dx w a ’ 


(50) 
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that is, 


F' = LFL~\ 


(51) 


[Recall that this includes the case of a Lorentz boost: L G 0(1, 3).] Therefore, 
we are allowed to do the same for the Dirac equation, i.e., we are allowed 
to investigate a relativistic transformation of that equation in which the 
matrices would not be Lorentz-invariant. Moreover, account must be made 
for the fact that the ’s are not univoqucly fixed by the algebra (E2D: it is 
well-known that any similarity transformation 


^ = Si/), Y = SYS' 1 , S G GL(4, C) (52) 


is allowed. Now Eq. (SZI) says precisely that, up to the transformation of 
the partial derivatives, the new matrices are deduced from the old ones by 
the similarity transformation corresponding to the very linear transformation 
which affects the wave function. Moreover, we note that the new matrices 
(TT71) verify 

yyo = yysyysV (53) 

whence, by (1321) . the anticommutation relations 

7 Y + 7 Y- V^Ia, (54) 

with 

<T > = yy <y. (55) 

The latter, of course, is precisely the expression of the (contravariant) met¬ 
ric tensor after the general linear coordinate change characterized by matrix 
L G GL(4, R). In the case of flat space-time, and if one starts from a Galilean 
coordinate system, i.e. one in which the metric has the Minkowskian form 
(1221) . that Minkowskian form is conserved if and only if L G 0(1,3). This 
is true with the matrices (S3), independently of the chosen representation 
S—provided S is defined for L G 0(1,3). 

Let us summarize. If one considers linear coordinate transforma¬ 
tions L belonging to some subgroup G of GL(4, R), and if any representation 
S of G into GL(4, C) is available, then the free Dirac equation (|35j) . as well as 
that with an e.m. held, Eq. (SB, are form-invariant after the transformation 
defined by Eqs. (SB for the wave function and SB for the matrices q M , Eqs. 
SB and SB- If G D 0(1,3), the anticommutation relation (1321) is invari¬ 
ant {i.e. with the components of the metric being invariant) after a Lorentz 
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boost, Eqs. (151) and (155)1. The standard choice has been to impose that 
the matrices 7^ themselves are Lorentz-invariant, which leads to the spinor 
representation S S pi n0 r. This is a peculiar interpretation of the relativity prin¬ 
ciple, as: i) the (archetypically relativistic) equation of motion for a charged 
particle does contain a Lorentz- non- invariant matrix F, Eqs. (HID and (ED), 
and ii) there is no privileged set of matrices 7^ among the infinitely many 
sets satisfying the anticommutation relation (Si. 

An equally valid interpretation of the relativity principle allows us 
to select the simplest available representation: 

S = Identity, S (L) = L , (56) 

which is defined over the whole linear group, G = GL(4, R). Thus, from 
Eqs. (1421) and (1571) . we find that the wave function tranforms like an usual 
4-vector: 

<P'(x') = L4 (X), </>"- = Ly\ ( 57 ) 

and the Dirac matrices transform in the following way: 

i^ = ntLfL- 1 , ( 58 ) 

which resembles the transformation EH) of the e.m. field matrix, up to 
the fact that here there are four matrices, and, due to the transformation 
of the partial derivatives, they are “mixed” by the transformation. The 
transformation EZHSSl works as well for the Dirac equation in an e.m. 
field (1411) . It seems that little, if anything, is changed to the direct physical 
consequences of the Dirac equation: the point to be emphasized is that the 
equation , hence its solutions, are unchanged. Thus, the uon-relativistic limit 
of the equation with an e.m. held, leading to the Pauli equation involving the 
spin term; the exact solutions in a Coulomb field, including the splitting of 
the wave function into two components, each of which again includes a spin 
term, and leading to the energy spectrum of hydrogen-type atoms and to the 
explicit stationary states—all of this can be taken verbatim from Schulten 
0. Similarly, the propagator theory is unchanged, for it is based on the 
Green’s function of the Dirac equation, which we leave unchanged; therefore, 
we might also recopy the Bjorken-Drell |5 analysis. The Foldy-Wouthuysen 
transformation 0121 is in fact a change of the unknown (wave) function, thus 
it does not involve a coordinate change and can be taken as it is. It may be 
misleading that the wave functions which are solutions of the Dirac equation 
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are often called “spinors:” whether or not a function obeys the equation in 
a given coordinate system, is independent of the transformation behaviour 
on changing the coordinate system. The transformation behaviour is defined 
once one chooses the representation S in Eqs. (14211 and m - It would have 
been clearer if one would have reserved the word “spinor” to designate the 
particular representation S spinoi ., which leaves the 7 ^ matrices invariant. 

5 THE CASE WITH A GRAVITATIONAL 
FIELD 

In Sect. |2l the correspondence between a Hamiltonian and a wave equation 
has been, to some extent, justified, in the framework of an interpretation of 
that correspondence. To derive the Dirac equation from this correspondence, 
in the situation with a gravitational field, we have to ask if there is a Hamil¬ 
tonian for the motion of a test particle in a gravitational field. According 
to the foregoing interpretation, we need a Hamiltonian in the original sense 
of Hamilton’s mechanics, thus one in which the time coordinate t is inde¬ 
pendent of the position x in the configuration space, the latter being the 
three-dimensional (frame-dependent) space, not the space-time [TT]. (The 
same requirement is introduced, on somewhat different grounds, by Tagirov 
[25]-) This excludes the “super-Hamiltonian” H[{p^)^{x v )] flE j . I 11 Ref. [IT] , 
it was shown that there is indeed a Hamiltonian for the geodesic motion of a 

test particle in a static space-time metric g = (g^), and that it is given by 
4 

H{ p, x) = [goo(h 3k PjPk + rn 2 )} 1/2 , (60) 

the canonical momentum p being in fact the usual momentum, i.e. 

Pj = m r ) v h j kV k i (61) 

where v' J = (goo) -1 ^^/^ is the velocity in the static reference frame, mea¬ 
sured with local clocks, and where h = ( hjk ) is the spatial metric in that 

4 Recall that a static metric is one for which Eli 

9nv = guviix 1 )) and g oj = 0. (59) 

Bertschinger m states a Hamiltonian for geodesic motion in the case of a general metric, 
which Hamiltonian does coincide with pl|) in the static case. In Ref. E3, a Hamiltonian 
is written for the case that the metric is given in normal Gaussian coordinates. 
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frame and (h jk ) = , 7 „ = (1 — u 2 ) -1 / 2 being the Lorentz factor (with 

v 2 = hjkvhft). The time f is the static time coordinate, which is unique up to 
a constant factor [22 . Thus, we have a Hamiltonian satisfying the algebraic 
relation 

E 2 - gooih^pjPk + m 2 ) = 0 for E = H{ p, x), (62) 

so that we may apply the same method as in Sect. 01 We multiply the latter 
relation by g 00 = g accounting for the fact that lT k = gE ( j,k = 1,2,3) 
and g°i = 0. Using then the correspondence PD, we get the dispersion 
equation 

g^K^K u - m 2 = 0. (63) 

This is identical to the dispersion equation (H3 ) valid for a free particle in 
flat space-time. Therefore, just the same factorization W-B can be used 
as it is, leading to the same Dirac equation Wl) . One difference is that, 
in the case with a static gravitational held, the metric cannot be reduced 
to the Minkowskian form (|221) by a coordinate change preserving the static 
character of the metric , that is [22] 

x'° = ax°, x' j = <fd({x k )). (64) 

But it is only in “static-compatible” coordinates [he., ones in which the met¬ 
ric has the form PD] that the Hamiltonian PD does rule the motion. This 
means that, now, the Dirac equation is with “deformed” matrices, 
satisfying the anticommutation relation (El with a non-Minkowskian metric 
tensor g^ v . 

At this point, we have to ask in which coordinate systems it is 
allowed to write that Dirac equation. In the hat-space-time case, we have 
shown that the trivial representation dSHD may be used in the transforma¬ 
tion [©,(113)], which means that the wave function if transforms like a 
4-vector, Eq. m- Since the latter transformation is defined for any coor¬ 
dinate change, we may now extend (B21) to any regular transformation F of 
the space-time coordinates: 

il>'(X') = L(X).il>{X), L{X) = VF(X) {Lf = dx'^/dx u ). (65) 

Hence, we may study the transformation of the Dirac equation for a such 
general F, by redoing the reasoning that led to the transformed Dirac equa¬ 
tion PD, with here S = L(X). The only problematic step is that leading 
from dH to pd, precisely because here S = L depends on A". This step 
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cannot be made as is, unless we assume that the transformation F satisfies 
Eq. © at the event X(xq) = X(x'q) considered. If this condition is satisfied, 
we recover the Dirac equation (EH) in the new coordinates, though the Y 
matrices have changed to EHD- Thus, we find again, in the case of the Dirac 
equation, the general limitation which is inherent in our interpretation of the 
correspondence Hamiltonian - wave equation: namely, that one must con¬ 
sider coordinate changes satisfying condition (JBD — see Note 1 here, and see 
Ref. t T2] (or pm §2.1, for details. This limitation on the coordinate system 
to apply the correspondence is physical: consider the covariant form (1221) of 
the dispersion equation for a relativistic particle, say in a flat space-time. In 
any coordinate system, this is unambiguously associated with just one linear 
wave equation, by the correspondence ©: 

{g^d^dj, + m 2 )-0 = 0 . ( 66 ) 

If we started from Galilean coordinates, with (gY = diag(l, —1, —1, —1), 
this is indeed the Klcin-Gordon equation; but otherwise, e.g. with spherical 
space coordinates, this is a physically meaningless equation. Coming back 
to static gravitation, we have, therefore, to define a privileged class of static- 
compatible systems, exchanging by transformations that verify © . There is 
only one natural such class in the case of a general static metric: the systems 
for which the time coordinate x° is the static time t (defined up to a scale 
change), and which, moreover, are locally geodesic , at the spatial position x 
considered, for the spatial metric h in the static reference frame, that is, 

x° = at, hjkj(x.) = 0 (j, k, l G {1,2,3}). (67) 

Thus, we must impose that the gravitational Dirac equation has the form 
(1221) only in the coordinate systems satisfying conditions dSZD- But it is a 
characteristic feature of curved space that condition flEZh cannot be verified 
in an open domain. Hence, we must find the form taken by the equation in 
more general systems. 

6 GENERAL TRANSFORMATION OF THE 
DIRAC EQUATION WITH 4-VECTOR WAVE 
FUNCTION 

We transform the Dirac equation ESI), assumed valid (possibly with deformed 
matrices) in some coordinate system (x^), to a general coordinate system 
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(x w ), using the ^-vector transformation ED of the wave function. As al¬ 
ready seen, this leads first to Eq. ED with S = L(X). We get then, by 
differentiating L~ x if' (and since d M = L” &„)■. 

i r y w [d' u 'i/j' + L.(dlL- 1 )if)'] — mip' = 0, (68) 


where the y^’s are given by (ED- The new term writes more explicitly 

rpP 

MdiL-^r = Km{L-x\}r = m 

Using the transformation law that expresses the new Christoffel symbols Y'f a 
as function of the old ones Y p xe {e.g. Ref. jEU, Sect. 28, Eq. (25)}, this is 


[L.(dlL~ l 



dx' IJl dx x dx 9 \ 
dx p dx ,u dx" 7 xe ) 


V a ■ 


(70) 


Thus, if one assumes that (El) is valid in a “freely falling” system, such that 
Qfiu, P = 0 for all /i, v and p, and hence with all Y p xe zero, he obtains from (El) 
and (El the following form of the equation in the general system ( x' v )\ 


(i-TK -m)f = 0, (D'„fr = W = W" + 0/>" (71) 


In words: the standard way of adapting the Dirac equation to gravitation, 
i.e., via the equivalence principle, is compatible with the 4-vector transfor¬ 
mation of the wave function—it leads to write the equation with the usual 
covariant derivative of 4-vectors, and with y M matrices that change according 
to (IHH1) after a coordinate change. 

However, our specific assumption is that the Dirac equation ED is 
valid, indeed with deformed y M matrices, in a static-compatible system veri¬ 
fying g —because, in any such system, we derived ED from the classical 
Hamiltonian ED- Using Eqs. El) and El, we may rewrite this in any new 
coordinate system (x' u ) as 


A} — m) ij)' 


0, (A>y = « 


-p/fL 
L V(T 


dx'" dx x dx 9 p \ /CT 
dx p dx' v dx' a xe ) 


(72) 

In the static system (aU) verifying ED, the non-zero terms of the connection 
are (e.g., cf. Ref. 2T)i. §3.2) 


p0 _ p0 _ 

1 o j - 1 j o - 


f 9oo,j 

2 Poo 


r J oo — g ^ 9oo,k- 


(73) 
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The second term inside the bracket on the r.h.s. of Eq. CZh just transports 
this old connection to the new system {x' v ), as a ( 2 ) tensor: since the connec¬ 
tion does not cancel in the system (x M ), this transported connection cannot 
cancel as a whole, in any other system ( x' u ). If we take a freely-falling system 
as the new system (x >v ), we have all zero. Hence, the equation (E2D 1 ob¬ 
tained in this system is not the usual Dirac equation —in contrast with 

the gravitational Dirac equation obtained from the equivalence principle, be 
it that based on the 4-vector transformation [Eq. (I7T1) ] or the standard equa¬ 
tion Q 0. Thus, Eq. |7^i is not equivalent to the extension of the Dirac 
equation obtained from the equivalence principle, independently of the spinor 
or vector transformation of the wave function which is chosen for the latter. 
Let us now consider as the new system (x ,v ) a static-compatible one, thus 
one deduced from ( x by a purely spatial change (IH3lb . [We may forget the 
trivial scale change mi-} After the tensorial transport of the connection, 
defined above, the non-zero terms turn out to keep the same expression [the 
right-hand sides in ®] in the system ( x ,v ). We may then rewrite (A^')^ 
explicitly, without any reference to the old system (and hence omitting the 
primes): 


(AoV)" = <1>% - 

(74) 

z fl'oo 

(AW0° - </■" 

(75) 

* 9 00 

(A o 00 = 0f o - ^h jk g 0 o,k^°, 

(76) 

(A fc 0) J = 00, 

(77) 


where 00 is the covariant derivative of a 4-vector, Eq. <EH>2. Thus, in any 
static-compatible coordinate system, we may write our version of the Dirac 
equation in a static gravitational held as 

— m) 0 = 0, (78) 

where the components (Aj,0) M are given by Eqs. (I74[) - (I77|) . This is covariant 
by any static-compatible coordinate change (ED- 


21 




7 CONCLUSION 


We use an interpretation of the classical-quantum correspondence |T2j based 
on the mathematical relationship between a linear wave operator and its dis¬ 
persion relation(s), and also based on the idea of de Broglie and Schrodinger, 
according to which classical Hamiltonian mechanics describes the skeleton 
of an underlying wave pattern. Integrating these two aspects provides a 
solution [HI to the ambiguity of this correspondence, in the case that the 
Hamiltonian contains mixed terms with position x and canonical momentum 
p. That case is relevant to the situation with a gravitational held m 

However, for a relativistic particle, the classical Hamiltonian H is 
not a polynomial in p (at fixed x and t), but an algebraic function of it. The 
corresponding algebraic relation has another solution, which is E' = —H, 
and which does not describe the same dynamics as that obtained with H. 
This leads to the idea that, in factorizing that algebraic relation, one might 
get a more fundamental wave equation. The factorization does occur with 
coefficients in the larger algebra of the 4x4 matrices, and leads to the Dirac 
equation. Of course, it turns out that it also has negative-energy solutions, 
but nevertheless it is indeed more fundamental in that it describes more im¬ 
portant particles than does the Klein-Gordon equation. The main interest of 
the proposed approach is that the same derivation applies to three different 
cases: that of a free particle, that of a particle in an e.m. held, and that 
of a particle subjected to geodesic motion in a static gravitational held. In 
the latter case, there is indeed a Hamiltonian. Thus, our static-gravitational 
version of the Dirac equation is derived from the classical Hamiltonian af¬ 
ter factorization of the dispersion equation [Eqs. (1331) and (Hull) ], just in 
the same way as we derive the hat-space-time Dirac equation. Hence, this 
gravitational Dirac equation follows as directly from wave mechanics as does 
the hat-space-time Dirac equation, whereas the standard (Fock-Weyl) ver¬ 
sion just extends the hat version using the equivalence principle. It also 
implies that all solutions of this proposed version are solutions of the static- 
gravitational Klein-Gordon equation associated GH with the dispersion fSD - 
In contrast, it is not usually the case that a solution to the Fock-Weyl exten¬ 
sion of the Dirac equation obey either of the generally-covariant extensions 
of the Klein-Gordon equation. This confirms that the proposed gravitational 
Dirac equation is not equivalent to the standard extension, as was proved 
after Eq. (1731) . 

A surprising result of this work is that the standard 4-vector trans- 
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formation may be applied to the wave functions ip obeying the Dirac equation— 
provided one accepts that the matrices are not invariant in the transfor¬ 
mation of the equation. Oviously, an equation involving a matrix object may 
be covariant without the matrix being invariant: a relevant example is the 
relativistic equation of motion for a charged particle in an e.m. field, Eqs. 
© and m Furthermore, the set of the matrices is not uniquely defined, 
since the anticommutation relation (E2J) admits an infinity of solution sets. 
In the case of curved space-time, a coordinate change does anyway change 
the 7 ^ matrices, since it changes the metric coefficients in that relation (1321) . 
Thus, the proposed 4-vector behaviour of ip, associated with the transforma¬ 
tion dSHD for the 7 ^’s, which does leave the Dirac equation form-invariant, is 
formally admissible. 5 As to the observational aspect: we argue that, since 
the flat-space-time Dirac equation itself is the same, nothing is changed to its 
closest applications (the emergence of spin and the energy levels of hydrogen- 
type atoms), neither to those based on the Feynman propagator J5J, which 
is a Green’s function of the Dirac equation (ESD or (EH). Since the Dirac 
equation (especially after it is adapted for a quantum held) has applications 
in many parts of basic physics, however, it is hard to state in advance that 
no physical consequence is changed. 

In the static-gravitational case, the direct application of the classical- 
quantum correspondence, which has been done in Sect. [ 3 ] would not make 
sense with the usually-used (spinor) transformation. Indeed, the latter is 
restricted to the Lorentz group, hence it could not be used to rewrite the 
equation in any static-compatible system, by using a general spatial coordi¬ 
nate change, as it had to be done in Sect. |bl It remains to investigate the 
differences that could occur in the weak-held limits of the two gravitational 
extensions of the Dirac equation: the present one (limited to a static held), 
and the standard one mms], for which investigations of the weak-held limit 
already exist (Refs. .'IT |32], and references therein). Perhaps, such differ¬ 
ences might be observable in the future, e.g. in experiments on ultra-cold 
neutrons, such as transmission measurements through a horizontal slit in the 
Earth’s gravitational held |331l33j. 

5 The possibility of keeping the 4-vector behaviour is alluded to in Ref. [T7J, p. 715. 

A “four-vector behavior of the Dirac bispinor” is vindicated by Bell et al. ESI, in the 
framework of quaternion Dirac equation. But Eq. (3) of Ref. is not the standard 4- 
vector transformation [Eq. (15711 here], and when the former applies, it applies to transform 
an equation which is not equivalent to Dirac’s, in contrast with the present study. In the 
present work, the 4-vector transformation arose in the study of the gravitational case. 
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